Gang Scheduling Extensions for I/O Intensive Workloads
نویسندگان
چکیده
Scientific applications are becoming more I/O demanding than ever. For such applications, the system with dedicated I/O nodes does not provide enough scalability. Rather, a serverless approach is a viable alternative. However, with the serverless approach, a job’s execution time is decided by whether it is co-located with the file blocks it needs. Gang scheduling (GS), which is used in supercomputing centers to schedule parallel jobs, is completely not aware of the application’s spatial preferences. In this paper, we show that gang scheduling delivers poor performance towards workloads with high I/O intensities (I/O ratio higher than 50%). We propose an I/O-aware extension of gang scheduling, IOGS, which co-locates jobs with their files. While IOGS performs better for high I/O intensity workloads, its performance for workloads with lower I/O intensities is rather poor because of high system fragmentation. Further, we propose an adaptive strategy, adaptive-IOGS, which attempts to combine the advantages of both gang scheduling and GS, and we show that adaptive-IOGS is better than the other two schemes in many scenarios. Finally, we combine process migration techniques with adaptive-IOGS, and propose Migration-IOGS, which is shown to be the best among the four for a wide spectrum of workloads.
منابع مشابه
Implications of I/O for Gang Scheduled Workloads
This paper examines the implications of gang scheduling for generalpurpose multiprocessors. The workloads in these environments include both compute-bound parallel jobs, which often require gang scheduling, and I/O-bound jobs, which require high CPU priority to achieve interactive response times. Our results indicate that an effective interactive multiprocessor scheduler must weigh both the ben...
متن کاملImplications of I / O for Gang
The job workloads of general-purpose multiprocessors usually include both compute-bound parallel jobs, which often require gang scheduling, as well as I/O-bound jobs, which require high CPU priority for the individual gang members of the job in order to achieve interactive response times. Our results indicate that an eeective interactive multiprocessor scheduler must be exible and tailor the pr...
متن کاملImproving Throughput and Utilization in Parallel Machines through Concurrent Gang
In this paper we propose a new class of scheduling policies, dubbed Concurrent Gang, that combines the advantages of gang scheduling for communication and synchronization intensive parallel jobs with the flexibility of a Unix scheduler for sequential and I/O intensive jobs. Besides that, scalability in Concurrent Gang is achieved through the use of a global synchronizer that coordinates the gan...
متن کاملImproving Parallel Job Scheduling Using Runtime Measurements
We investigate the use of runtime measurements to improve job scheduling on a parallel machine. Emphasis is on gang scheduling based strategies. With the information gathered at runtime, we deene a task classiication scheme based on fuzzy logic and Bayesian estimators. The resulting local task classiication is used to provide better service to I/O bound and interactive jobs under gang schedulin...
متن کاملCoexisting Scheduling Policies Boosting I/O Virtual Machines
Deploying multiple Virtual Machines (VMs) running various types of workloads on current many-core cloud computing infrastructures raises an important issue: The Virtual Machine Monitor (VMM) has to efficiently multiplex VM accesses to the hardware. We argue that altering the scheduling concept can optimize the system’s overall performance. Currently, the Xen VMM achieves near native performance...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003